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3 <110> APPLICANT: Tsang et al . 

5 <120> TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR DETECTING LARVAL TAENIA SOLIUM 
7 <130> FILE REFERENCE: 6395-62068 
9 <140> CURRENT APPLICATION NUMBER: 10/048, 146B 
10 <141> CURRENT FILING DATE: 2000-08-03 

12 <150> PRIOR APPLICATION NUMBER: US 60/147,318 

13 <151> PRIOR FILING DATE: 1999-08-03 

15 <150> PRIOR APPLICATION NUMBER: PCT/USOO/21173 

16 <151> PRIOR FILING DATE: 2000-08-03 
18 <160> NUMBER OF SEQ ID NOS : 9 

20 <170> SOFTWARE: Patentin version 3.1 

22 <210> SEQ ID NO: 1 

23 <211> LENGTH: 2153 

24 <212> TYPE: DNA 

25 <213> ORGANISM: Taenia solium 

27 <220> FEATURE: 

28 <221> NAME/KEY: CDS 

29 <222> LOCATION: (145) (531) 

30 <223> OTHER INFORMATION: 
W — > 33 <400> 1 

34 ctgcagtgaa gttgacaagt agttgaccat ttacggaaca tcaatggagg acactttggt 60 
36 agggaaagca tacgataaac ataaaccaat gctggttata taagagacga tctcggctac 120 
38 acttgtaact gaacaacctg taga atg cgt gcc tac att gtg ctt etc get 171 



39 














Met Arg Ala ' 


Tyr 


lie Val Leu Leu Ala 




40 
42 


etc 


act 


gtt 


ttc 


gta 


gtg 


acg 


gtg 


teg 


gee 


gag 


tgg 


gtg 


cec 


att 


teg 


219 


43 


Leu 


Thr 


Val 


Phe 


Val 


Val 


Thr 


Val 


Ser 


Ala 


Glu 


Trp 


Val 


Pro 


He 


Ser 




44 


10 










15 










20 










25 




46 


agt 


gtc 


cac 


ata 


gcc 


tea 


tgc 


aaa 


age 


cac 


tac 


atg 


ttc 


caa 


tta 


aaa 


267 


47 


Ser 


Val 


His 


He 


Ala 


Ser 


Cys 


Lys 


Ser 


His 


Tyr 


Met 


Phe 


Gin 


Leu 


Lys 




48 










30 










35 










40 






50 


cgc 


ttt 


ttt 


gcc 


ttt 


agg 


aaa 


aac 


aaa 


ccg 


aaa 


gat 


gtt 


gca 


aat 


agt 


315 


51 


Arg 


Phe 


Phe 


Ala 


Phe 


Arg 


Lys 


Asn 


Lys 


Pro 


Lys 


Asp 


Val 


Ala 


Asn 


Ser 




52 








45 










50 










55 








54 


acg 


aaa 


aaa 


ggg 


ata 


gaa 


tat 


gtc 


cac 


gaa 


ttc 


ttc 


cac 


gaa 


gac 


ccg 


363 


55 


Thr 


Lys 


Lys 


Gly 


He 


Glu 


Tyr 


Val 


His 


Glu 


Phe 


Phe 


His 


Glu 


Asp 


Pro 




56 






60 










65 










70 










58 


att 


ggt 


aaa 


caa 


att 


get 


caa 


etc 


gca 


aag 


gaa 


tgg 


aag 


gaa 


gca 


atg 


411 


59 


He 


Gly 


Lys 


Gin 


He 


Ala 


Gin 


Leu 


Ala 


Lys 


Glu 


Trp 


Lys 


Glu 


Ala 


Met 




60 




75 










80 










85 












62 


ttg 


gaa 


ggt 


agg 


ttt 


tgg 


tgt 


ttt 


ctg 


tea 


gaa 


gaa 


aat 


tat 


eta 


ttc 


459 


63 


Leu 


Glu 


Gly Arg 


Phe 


Trp 


Cys 


Phe 


Leu 


Ser 


Glu 


Glu 


Asn 


Tyr 


Leu 


Phe 




64 


90 










95 










100 










105 
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66 att cat eta gac aaa ggc aaa ata egg acg tea ctg gtt gag cac tgc 

67 lie His Leu Asp Lys Gly Lys lie Arg Thr Ser Leu Val Glu His Cys 

68 110 115 120 

70 aaa ggt cat aag aaa aaa act get taacttgtca actttcatge gttcttetct 

71 Lys Gly Pro Lys Lys Lys Thr Ala 

72 125 

74 tcactaataa atgetcatta ataagaaagc tgccttttgc aagatcaacg agggccatag 
76 actgtgaggg ttatageeta aggttatggg gtgaaatgag ataggaattg ageatttgag 
78 aagttactaa tttaaattga aagecgeatt tettctgeaa ttgacgtgtg atggttageg 
80 aaaccaagtg aagcacgacc tcttgagtcg tttcaacagc cgecagtggt ttcaeeagtg 
82 gettcaccag tgggtagact ggtttgtcac acatgcgagg tacggtcaga gggetaacag 
84 gtgtggtgga ggggccaaca cgtgtaagac aagcagttcc ccttctctgt cgtgaggcac 
86 aetcageace cacctcgttt acttctccct tgacgaetgt aatgeatttg gggteaecat 
88 gcccecgcca agttgaaggc actgatgaca tttgtaecat ateaecgata agtattaact 
90 cttccacttc ecagattttg aggtcaggcg atcctactga ctcggtgtag ccccatggtg 
92 gtccatgetc tgcaccatte gctgttcagt ggagcateca cetagaegge caaccaatct 
94 cgcctccctt ctectgtget caagatgtgc gtcggtgaga tttggagggt etgatcacea 
96 tactaaccac gtaggtttca tcatctctaa gaagcaccac ttcttgaggt cgcattgtgt 
98 accaccagee ggtgtaatca agagtgaett tegegtcacc cctaagaagg etatagatct 
100 geaagteage gcaatagett cagccatgct gactaaaatg tgtaagggae cagtagctet 
102 agcccaacac aagtggagct aataatgggc ttccccagat acatgaatcc caaatcggtg 
104 agcatgggcc atgaatatgg ecttctgagt cttecttgaa tgcaaacgaa ggcatagcac 
106 gagggtagga tgagtgtaca gaaaacagcg aggcaacgaa tctactggca tggccctgat 
108 geeaeccege ceagetaggg tagtttggcc aeetcagtec ttaatcgaat geggcagtea 
110 gaaeaaaeaa agtattaeat agccaeaetc ttettttgag egtegteete gaegetcctt 
112 tcgacacacc teccgcatca gccaceacaa agtaateagt aetggggaga cacccacgag 
114 etaacegtge eagtcatgga aaatttgacg gcaaetgagg agatgcctga ccccctttgg 
116 eagttegaat getgcecgtg gtcaaaetee tgcateagec atcacctaeg atteaaacat 
118 eetagtcgee aaattttcgt gaaeecteta aaattttegt geaeteteaa gaeaetteea 
120 aetgaettag agetttttca tttggtgaga aeaegtaaaa getteaagta aaeaaeaggc 
122 aaegatttea ctttgatgct ctcaccatca attctcttgt atgtgccacc accttaaacc 
124 etecetgaee aetteeaetc tetetetetc eetaaataae aaeaettgga ageatgaatg 
126 gtgtetgtea aagttaeace ectagactge ag 

129 <210> SEQ ID NO: 2 

130 <211> LENGTH: 129 

131 <212> TYPE: PRT 

132 <213> ORGANISM: Taenia solium 
134 <400> SEQUENCE: 2 

136 Met Arg Ala Tyr He Val Leu Leu Ala Leu Thr Val Phe Val Val Thr 

137 1 5 10 15 

14 0 Val Ser Ala Glu Trp Val Pro He Ser Ser Val His He Ala Ser Cys 
141 20 25 30 

14 4 Lys Ser His Tyr Met Phe Gin Leu Lys Arg Phe Phe Ala Phe Arg Lys 
145 35 40 45 

148 Asn Lys Pro Lys Asp Val Ala Asn Ser Thr Lys Lys Gly He Glu Tyr 

149 50 55 60 

152 Val His Glu Phe Phe His Glu Asp Pro He Gly Lys Gin He Ala Gin 

153 65 70 75 80 
156 Leu Ala Lys Glu Trp Lys Glu Ala Met Leu Glu Gly Arg Phe Trp Cys 



507 



561 



621 
681 
741 
801 
861 
921 
981 
1041 
1101 
1161 
1221 
1281 
1341 
1401 
1461 
1521 
1581 
1641 
1701 
1761 
1821 
1881 
1941 
2001 
2061 
2121 
2153 
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157 ' 85 90 95 

160 Phe Leu Ser Glu Glu Asn Tyr Leu Phe lie His Leu Asp Lys Gly Lys 

161 100 105 110 

164 He Arg Thr Ser Leu Val Glu His Cys Lys Gly Pro Lys Lys Lys Thr 

165 115 120 125 
168 Ala 

172 <210> SEQ ID NO: 3 

173 <211> LENGTH: 298 

174 <212> TYPE: DNA 

175 <213> ORGANISM: Taenia solium 

177 <220> FEATURE: 

178 <221> NAME/KEY: CDS 

179 <222> LOCATION: (3).. (224) 

180 <223> OTHER INFORMATION: 
W~> 183 <400> 3 

184 ta ttc gta gtg gcg gtt teg gcc gag aaa aac aaa cog aag tgt gat 47 

185 Phe Val Val Ala Val Ser Ala Glu Lys Asn Lys Pro Lys Cys Asp 



186 




1 






5 








10 








15 




188 


gca 


aat 


agt 


act 


aag aaa 


gag 


ata 


gaa 


tat 


ate 


cae 


aat 


tgg 


ttt ttc 


95 


189 


Ala 


Asn 


Ser 


Thr 


Lys Lys 


Glu 


He 


Glu 


Tyr 


He 


His 


Asn 


Trp 


Phe Phe 




190 










20 








25 










30 




192 


cat 


gat 


gac 


ccg 


att gga 


aaa 


caa 


att 


get 


caa 


etc 


gca 


aag 


gac tgg 


143 


193 


His 


Asp 


Asp 


Pro 


He Gly 


Lys 


Gin 


He 


Ala 


Gin 


Leu 


Ala 


Lys 


Asp Trp 




194 








35 








40 










45 






196 


aat 


gaa 


aca 


gtg 


cag gaa 


gcc 


aaa 


ggc 


aaa 


ttt 


tgg 


gcg 


tea 


ctg get 


191 


197 


Asn 


Glu 


Thr 


Val 


Gin Glu 


Ala 


Lys 


Gly 


Lys 


Phe 


Trp 


Ala 


Ser 


Leu Ala 




198 






50 








55 










60 








200 


gag 


tac 


tgc 


aga 


ggt ctg 


aag 


aac 


aaa 


act 


get 


taacttgtca actttcatgc 


244 


201 


Glu 


Tyr 


Cys 


Arg 


Gly Leu 


Lys 


Asn 


Lys 


Thr 


Ala 












202 




65 








70 





















204 gttcttctet tcaccaataa atgetgatta acaagaaaaa aaaaaaaaaa aaaa 298 

207 <210> SEQ ID NO: 4 

208 <211> LENGTH: 74 

209 <212> TYPE: PRT 

210 <213> ORGANISM: Taenia solium 
212 <400> SEQUENCE: 4 

214 Phe Val Val Ala Val Ser Ala Glu Lys Asn Lys Pro Lys Cys Asp Ala 

215 1 5 10 15 

218 Asn Ser Thr Lys Lys Glu He Glu Tyr He His Asn Trp Phe Phe His 

219 20 25 30 

222 Asp Asp Pro He Gly Lys Gin He Ala Gin Leu Ala Lys Asp Trp Asn 

223 35 40 45 

226 Glu Thr Val Gin Glu Ala Lys Gly Lys Phe Trp Ala Ser Leu Ala Glu 

227 50 55 60 

230 Tyr Cys Arg Gly Leu Lys Asn Lys Thr Ala 

231 65 70 

234 <210> SEQ ID NO: 5 

235 <211> LENGTH: 294 

236 <212> TYPE: DNA 
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237 <213> ORGANISM: Taenia solium 

239 <220> FEATURE: 

240 <221> NAME/KEY: CDS 

241 <222> LOCATION: (3).. (221) 

242 <223> OTHER INFORMATION: 
W~> 245 <400> 5 

246 tt ttc gta gtg gcg gtg teg gcc gag gaa act aaa cca gag gac gtg 47 

247 Phe Val Val Ala Val Ser Ala Glu Glu Thr Lys Pro Glu Asp Val 

248 15 10 15 



250 


gta 


aag 


aat 


att 


aag 


aaa 


ggg 


atg 


gaa 


gtt 


gte 


tac 


aaa 


ttt 


ttc 


tac 


95 


251 


Val 


Lys 


Asn 


He 


Lys 


Lys 


Gly 


Met 


Glu 


Val 


Val 


Tyr 


Lys 


Phe 


Phe 


Tyr 




252 










20 










25 










30 






254 


gaa 


gac 


ccg 


ttg 


gga 


aag 


aaa 


ata 


get 


caa 


etc 


gca 


aag 


gac 


tgg 


aag 


143 


255 


Glu 


Asp 


Pro 


Leu 


Gly 


Lys 


Lys 


He 


Ala 


Gin 


Leu 


Ala 


Lys 


Asp 


Trp 


Lys 




256 








35 










40 










45 








258 


gaa 


gca 


atg 


ttg 


gaa 


gee 


aga 


age 


aaa 


gtg 


egg 


gcg 


tea 


etg 


get 


gag 


191 


259 


Glu 


Ala 


Met 


Leu 


Glu 


Ala 


Arg 


Ser 


Lys 


Val 


Arg 


Ala 


Ser 


Leu 


Ala 


Glu 




260 






50 










55 










60 










262 


tac 


ate 


aga 


ggt 


etc 


aag 


aac 


gaa 


get 


get 


taaettgtea . 


aetttcatge 


241 


263 


Tyr 


He 


Arg 


Gly 


Leu 


Lys 


Asn 


Glu 


Ala 


Ala 
















264 




65 










70 























266 gttcttetct teactaataa atgetcatta ataagaaaaa aaaaaaaaaa aaa 294 

269 <210> SEQ ID NO: 6 

270 <211> LENGTH: 73 

271 <212> TYPE: PRT 

272 <213> ORGANISM: Taenia solium 
274 <4 00> SEQUENCE: 6 



276 


Phe 


Val 


Val 


Ala Val 


Ser 


Ala 


Glu 


Glu 


Thr 


Lys 


Pro 


Glu 


Asp 


Val 


Val 


277 


1 






5 










10 










15 




280 


Lys 


Asn 


He 


Lys Lys 


Gly 


Met 


Glu 


Val 


Val 


Tyr 


Lys 


Phe 


Phe 


Tyr 


Glu 


281 








20 








25 










30 






284 


Asp 


Pro 


Leu 


Gly Lys 


Lys 


He 


Ala 


Gin 


Leu 


Ala 


Lys 


Asp 


Trp 


Lys 


Glu 


285 






35 








40 










45 








288 


Ala 


Met 


Leu 


Glu Ala 


Arg 


Ser 


Lys 


Val 


Arg 


Ala 


Ser 


Leu 


Ala 


Glu 


Tyr 


289 




50 








55 










60 










292 


He 


Arg 


Gly 


Leu Lys 


Asn 


Glu 


Ala 


Ala 
















293 


65 








70 























296 <210> SEQ ID NO: 7 

297 <211> LENGTH: 6 

298 <212> TYPE: PRT 

299 <213> ORGANISM: Taenia solium 
301 <4 0'0> SEQUENCE: 7 

303 He Ala Gin Leu Ala Lys 

304 1 5 

307 <210> SEQ ID NO: 8 

308 <211> LENGTH: 24 

309 <212> TYPE: PRT 

310 <213> ORGANISM: Taenia solium 
312 <220> FEATURE: 
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313 <221> NAME/KEY: variant 

314 <222> LOCATION: (7) . . (8) 

315 <223> OTHER INFORMATION: Amino acid at position 7 may also be valine 

318 <220> FEATURE: 

319 <221> NAME/KEY: site 

320 <222> LOCATION: (21) . . (22) 

321 <223> OTHER INFORMATION: Asparagine at position 21 is an amino acid insertion 

324 <220> FEATURE: 

325 <221> NAME/KEY: variant 

326 <222> LOCATION: (14).. (15) 

327 <223> OTHER INFORMATION: Amino acid at position 14 may also be glycine 

330 <220> FEATURE: 

331 <221> NAME/KEY: variant 

332 <222> LOCATION: (18).. (19) 

333 <223> OTHER INFORMATION: Amino acid at position 18 may also be valine 

336 <220> FEATURE: 

337 <221> NAME/KEY: variant 

338 <222> LOCATION: (19).. (20) 

339 <223> OTHER INFORMATION: Amino acid at position 19 may also be histidine 
34 2 <220> FEATURE: 

343 <221> NAME/KEY: variant 

344 <222> LOCATION: (20) . . (21) 

34 5 <223> OTHER INFORMATION: Amino acid at position 20 may also be arginine 
348 <400> SEQUENCE: 8 

350 Lys Asn Lys Pro Lys Asp Asp Ala Ala Ser Thr Lys Lys Glu lie Glu 

351 15 10 15 

354 Tyr lie Trp His Asn Phe Phe Phe 

355 20 

358 <210> SEQ ID NO: 9 

359 <211> LENGTH: 13 

360 <212> TYPE: PRT 

361 <213> ORGANISM: Taenia solium 

363 <220> FEATURE: 

364 <221> NAME/KEY: variant 

365 <222> LOCATION: (5).. (6) 

366 <223> OTHER INFORMATION: Amino acid at position 5 may also be isoleucine 

369 <220> FEATURE: 

370 <221> NAME/KEY: variant 

371 <222> LOCATION: (12).. (13) 

372 <223> OTHER INFORMATION: Amino acid at position 12 may also be aspartic acid 

375 <220> FEATURE: 

376 <221> NAME/KEY: variant 

377 <222> LOCATION: (7).. (9) 

378 <223> OTHER INFORMATION: Amino acid at position 7 may also be asparagine 

381 <220> FEATURE: 

382 <221> NAME/KEY: site 

383 <222> LOCATION: (8).. (9) 

384 <223> OTHER INFORMATION: Tryptophan at position 8 is an amino acid insertion 
387 <400> SEQUENCE: 9 
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L:33 M:258 W: Mandatory Feature missing; <223> Blank for SEQ# : 1, Line# : 30 
L:183 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 3, Line# : 180 
L:245 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 5, Line* : 242 
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